Basic Statistics

Raw Counts

Name Value
Rows 233,154
Columns 30
Discrete columns 4
Continuous columns 26
All missing columns 0
Missing observations 0
Complete Rows 233,154
Total observations 6,994,620
Memory allocation 53.4 Mb

Percentages

Data Structure

root (Classes 'data.table' and 'data.frame': 233154 obs. of 30 variables:)UNIQUEID (num)DISBURSED_AMOUNT (num)ASSET_COST (num)LTV (num)BRANCH_ID (num)SUPPLIER_ID (num)MANUFACTURER_ID (num)CURRENT_PINCODE_ID (num)EMPLOYMENT_TYPE (chr)STATE_ID (num)EMPLOYEE_CODE_ID (num)AADHAR_FLAG (num)PAN_FLAG (num)VOTERID_FLAG (num)PERFORM_CNS_SCORE (num)PERFORM_CNS_SCORE_DESCRIPTION (chr)PRI_NO_OF_ACCTS (num)PRI_ACTIVE_ACCTS (num)PRI_OVERDUE_ACCTS (num)PRI_CURRENT_BALANCE (num)PRI_SANCTIONED_AMOUNT (num)PRI_DISBURSED_AMOUNT (num)PRIMARY_INSTAL_AMT (num)NEW_ACCTS_IN_LAST_SIX_MONTHS (num)DELINQUENT_ACCTS_IN_LAST_SIX_MONTHS (num)NO_OF_INQUIRIES (num)LOAN_DEFAULT (num)formatted_DATE_OF_BIRTH (Date, format)formatted_DISBURSAL_DATE (Date, format)AGE (num)

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 2 columns ignored with more than 50 categories.
## formatted_DATE_OF_BIRTH: 15433 categories
## formatted_DISBURSAL_DATE: 84 categories

QQ Plot

Correlation Analysis

## 2 features with more than 20 categories ignored!
## formatted_DATE_OF_BIRTH: 15433 categories
## formatted_DISBURSAL_DATE: 84 categories

Principal Component Analysis

## 2 features with more than 50 categories ignored!
## formatted_DATE_OF_BIRTH: 15433 categories
## formatted_DISBURSAL_DATE: 84 categories